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A Simple Direct Proof of Post's Normal Form Theorem 

by Marvin Minsky 


The theorem proved in this note is the Normal Form Theorem proved 
in Post's 1943 paper, "Formal Reductions of the General Combinatorial 
Decision Problem". We have long felt that this result is one of the 
most beautiful in mathematics. The fact that any formal systems can 
be reduced to Post canonical systems with a single axiom and produc¬ 
tions of the restricted form 


QfOi 


otr 


is in itself a remarkable discovery, and even more so when we learn 
that this was found in 1921, long before the formalization of meta¬ 
mathematics became so popular. Post's proof of the theorem is quite 
readable. Our proof is somewhat simpler, and quite a bit shorter. 

Its brevity is a result of doing the necessary operations with less 
concern about the order in which they are done. Some clarity is 
gained because this technique requires fewer auxiliary symbols. 

Some clarity is lost because of the superposition of operations and, 
accordingly, we resort to illustrating the process by a detailed 
example (which brings the writing back to roughly its original length). 
Still, the techniques used here are somewhat different and give, per¬ 
haps, a better feeling for why the theorem is true. 


The pap ex ' 1 concludes with a modest contribution—a very simple 
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"Universal Canonical Extension" based on the same techniques, 
end up with a "Universal Canonical Extension" with three lotto 
(minimal in a sense) and four productions (minimal v ). 




A Simple Direct Proof of Post's Normal Form Theorem 


We first prove the theorem for systems having single-antecedent 
productions. Suppose that we have a Post Canonical System P for which 
a typical production has the form 
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where each variable string a. is one of the a. and cr , ..., rr 

i* 1 o 

r , T m are given fixed strings. This production will apply to any 

string which has the form of the antecedent—that is, any string which 

begins with o q , contains non-overlapping occurences of cr,, er^, ... er n _^ 

(in that order) and ends with cr . We will construct a new Normal system, 
an extension P* of P, by replacing each production t by a set of new pro¬ 
ductions using some new symbols. The task for the system P* is (i) to 
verify that a string S can be so analysed as an antecedent of v and then 
(ii) to assemble a consequent string composed of the r-strings with the 
proper strings for cc. , ..., O' inserted in the correct positions. The 
problem is to do all this using only normal productions--productions of 
the form 


(TOL 

The difficulty is that normal productions can "look" only at the beginning 
of a string* they cannot examine a string's interior to see, e. g., if 
it contains an occurrence of cr,. 

We can evade this limitation by "rotating" the string so that suc¬ 
cessive symbols are brought up to the beginning. The following produc¬ 
tions will thus test whether, in an initial string Tof, the unknown 
string Of has the required antecedent form. Suppose the letters of the 
original alphabet are a,, a 2 , ..., a . We introduce new letters T,,T 2 , 
..., T n and two more, Z and Y. (We use upper-case letters for all new 
symbols used in the extension P*.) 


To- a ”* ofZT, 
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a .a —*■ Ofa . 
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= 1,..., r) 

T. a .a -+ ofa ,T. 
i J j i 
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Vi 0 - <* 1+ i 
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To see how 

this works. 

we study an example. 

a^oe a^a^oc^a 

_ a^oua,, a 

^a n c^a^a^a^ a n a rl 

3 i 2 5 2 
is, 

13 3* 

2 1 3 2 2 31 5 

= a 3 

T o " a 2 a l 

a. =a~ n = 3 

X 1 ^ 

ff l = a 2 a 5 

T 1 * a 2 a 2 

a. = a_ m = 2 

i 2 3 

<t 2 = a i a 3 

t 2 ‘ a l a 5 



Suppose that ir is 


<r 3 = a 4 

Suppose that the actual string S is, in fact, 
a 3 l a l a 2l a 2 a 5 K a 2l a l a 3 l a 2 a ll a V 

which is in the form of the antecedent of tt. Then the following strings 
are produced: 

T a J a i a 2 a 2 a 5 a 4 a 2 a l a 3 a 2 a l a 4 
laJaga^^a^aga-^a^aga^ajiZ 

| |a 2 a 2 aca^ . . . a^a^Z T 1 a 1 

| I a ? a 3 a^ • « « a^a^Z s*| stp 


a,a^Z T,a,a 2 


Z ^ a x a 2 a 2 a 5 a 4 a 2 a l a '3 a 2 a l a 4 
T^a^a^a^* ...... a^Z 


a 2 a 2 


a 4 z a l T l 


^l a 2 a 2 a 5 a 4 # .... aj^Z a ^ 

a 2 a 5 a 4* .... a^Z a 2.^2^X 

I pi a 2 a 5 a 4 ***** a^a 2 • 

Now there is a choice, for the system can apply either 


or 


T l a 2“ “ 
l ^l a 2 a 5° ! 


“Vl 

— aT„ 


Thus two strings are produced; 


and 


a^a^ • • # • a^Z a^ stpap^T-j 

a^. . . . a^Z a-^a 2 T 2 . 
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The first string is ultimately doomed, because there will never again 
be an opportunity to eliminate the symbol T, from it—so it can never 
yield a string in the original alphabet of P. But the second string 
can continue on to 

Tga^aga^a^aga^a^Z a l a 2 

a 2 a l a 3 a 2 a l a 4 Z a l a 2 a 4 T 2 

^2 a 2 a l a 3 a 2 a l a 4 Z a l a 2 a 4 

a l a 3 a 2 a l a 4 Z a i a 2 a 4 a 2^2 

Tga^a^aga^a^Z a-, apa^ap ♦ 

Again there is a choice, producing 

a^aga^a^Z a^aga^aga^Tg 


and 


a 2 a l a 4 Z a l a 2 a 4 a 2^3 


Again the first string is doomed, 
but the second string leads to 

Ta n apa^stp 

a l a 4 Z a i a 2 a 4 a 2 a 2 T 3 


since Tg will never be eliminated. 


T3 a i a 4 z a^aga^agap 

a^Z a^aga^agaga-^T^ 

T^a^Z a*|aga^agapa-, 

a l a 2 a 4 a 2 a 2 a l"¥ 

A string containing a Y can be produced only if the original 
string had the required antecedent form! This shows how we can com¬ 
pound normal productions to analyse into the interior of a string. 

We have shown how normal productions can be used to determine 
whether or not a string has the antecedent form. But we cannot use 
our result to construct the consequent , because the resulting string 
(containing 'Y') has the a-strings "run together". That i^ we have 
destroyed information needed to show how the string 

jfj ^ 2 ^ 2 . 

is composed of three strings a^, and a^. To correct this we present 
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a revised (and final) version of the productions above. 
Productions for the Normal Extension 

The complete method requires a new set of symbols 

A 3* A l A l 


JU 

1* 

A 2 , • • 

A 
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A 2 
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1» 
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These are used to keep track of the contents 
. , a . The new productions are 

To- a -*• a Z T„ 


of the strings a^, a , 
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Z a 
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a Z 
a Y 
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a .a “♦ aaj -(j = i, 

Vi 1 * °®i+i- (1 = 1 - 

T t a ,or* aA jTj- --(1 - 1, 
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(all i, j) 


T a ZortxQ. 

n n 


' .— ■ I ^ 

The only change concerns the symbols A.. If the reader retraces the 
example above he will find that the final result will be 

_ .'TTT ^2^2. 3 I 3 

Q A 1 k 2 ^^2 A 2 A l 


so that now, by inspecting the superscripts, one can tell that 

a l ” a l a 2 , a 2 = a 4 a 2' anc * a 3 “ a 2 a l* All the required information has 
been preserved, so we can go on to construct the consequent. 

Before going on, observe that if the original string can be resolved 
into antecedent form in several ways we will get a distinct Q-string for 
each. Thus if we start with the string 

Ta 3 a 1 a 2 a 5 a 2 a 2 a 5 a 1 a 3 a 2 a 4 

in our previous example, we can analyse it either as 


or as 


g ol a ll a l [ a 2 a 2 a 5j °2 [ a 2r 3 

Oq ( a-j^aga^agj [a^ | ^ 3 • 


Therefore the system yields the two Y-strings 
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and 


Q aJa|a|a \ k \ 

0 A^A*^ A^” A^* A^ 

XI*| «V^AA ^ «lgiAg 0 


In the latter, a 2 is a null-string, which is perfectly all right. 

Now we construct the consequent. We will use the letter Q to 

initiate the process by setting-up the consequent form into which the 

X n 

-strings are to be copied. We will use new letters V x ,...,V to mark 
the places corresponding to the variables i n the consequent. 

The consequent form is introduced by the production 


Qor 


ar 0 V il T 1 V 12 T 2 . . . V 1 ™^ Y, 


where r Q , . ..,t are the constant strings of the consequent of ir. A set 
of produ ctions _. 


A j a tc a 


“Vj 


allow the A's to move freely to the right, passing over letters in 
the original alphabet. We do not allow the A's to pass one another, 
hence their order is always preserved. We also allow the A's to pass 
over some of the V's: 


_ i Tr k 
A .V or 

J 


ort^Aj-(if M k). 


Hence the A's ignore V's with different superscripts. But when the 


superscr ipts agree, we use 

. i TT i _T „i 1 
A.V or -*• ckV a.A.. 

J J J 


Hence, whenever an A. passes over a V with the same superscript— 
corresponding to the same variable string or. — it leaves there a copy of 
that letter a. which caused that A-symbol to appear. The result is 

1 j 

that when all A's have passed over all V's, each V will be followed by 
a copy of the corresponding arstring. To see this in our example, 

Q A^A^A^ “ 

A^A^A^AlAlA^a^a^V^a Z Y 

5 

» « • • 

A l A 2 A 4 A 2 A 2 a 2 a l Y ^ a l a 2 a 2 Y ®l a l a 5 A l Z Y 

0000 " " 

a 2 a l V ^ a 2 a l a 2 a 2 V ^ a 2 a l a l a 5 A l A 2 A 4 A 2 A 2 A l Z Y 


and we see that a copy of o:^ ^a^ag has been inserted to the right of 
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each occurrence of V 






When the string reaches this form, all the work is done and it 
remains o nly to elimina te the auxiliary letters of P*. The productions 

AjZa -+ aZ 

remove the A's, while the productions 

Ya .a -*■ ora .Y 

Y V 1 cr+ aY 

erase all the V's. Now all these events can occur in many different 


orders. However the net effect is always the same, once Y arrives at 
the immediate left of Z. Por then all A's have been erased and also all 
V's. Whe n this happ ens we apply, finally, 

YZa -► orT 


yielding theultimate desired result 

Tt a. T-, or. ... or. r . 

o i, 1 i 0 i„, m 

12 m 

Observe that Y cannot pass over an Ay Therefore, if Y is to move over 
to Z it must "push" the A's ahead of it, and hence all A's must even¬ 
tually pass over all V's. Y erases V's when it comes to them, but 
clearly it cannot erase a V until it gets to it, and to get there it must 
have pushed all A's past that point. Thus no V is erased until it has 
thus done its work of copying. 


Problem: Prove the theorem using productions a®”* <*t in which neither 

<t nor t have more than two, and not both have two, letters. Do not 
hesitate to use great quantities of letters. 

Completing the proof 

There remain a few loose ends. Observe first that we have not 
really produced a legitimate extension P* of P, because P* does not 
produce any strings which have only letters of P. Proof of this 
statement: The axioms of P* all contain the upper-case letter T. Every 

production which involves an upper-case letter in its antecedent constant 
string produces at least one such letter in its consequent string.. There¬ 
fore, (by induction) each produced string contains an upper-case letter. 

Obviously, the strings we wish to "detach" are those which start 
with 'T f In fact any string Ta can be interpreted as an assertion 
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that a is a Theorem of the system P. It would be tempting simply to 
introduce a production 

Ta “+ a 

to remove an initial T. The trouble is that this would lead to 
spurious strings in the P-alphabet, because of the productions 

apL-^aj 

which are already part of the system. Thi3 shows, in fact, that there is 
no way to obtain the desired result in our system, because we will 
always get "rotated" strings with legitimate strings. The cure requires 
a new alphabet. Suppose that the alphabet of P is really b.,...,b r 
and that a-,,...,a are really new letters of P*. Then we can "detach" 

V 

•T 1 by the productions 



The trick depends on the fact that we do not allow the b's to rotate 
(there are no productions b^a ab .) 

Of course these productions make it possible for an a^ to be con¬ 
verted to a b. at any step of the process, but when this happens 
prematurely it only yields "doomed" strings which are not pure strings 
in P and cannot yield any of same, once the 1 b' letter rotates to the 
front of the string. Only in the case that the conversion begins with 
*Ta.’, so that the resulting string comes to begin with 'R', can the 
(otherwise) always-present upper-case letter be removed, for 

Ra a 

is the only production of the entire system that can eliminate the 
auxiliary upper-case letter. 

Another loose end is involved in the case of systems with more 
than one production. Here we simply carry out the whole above construc¬ 
tion for each production, using entirely new letters for all upper¬ 
case letters except T. Then the productions can operate independently, 
only linked by T so that each system can operate on the final results of 
the operation of other systems. 

We need one more theorem to complete our proof that the Canonical 
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systems of Post, in their full generality, can be replaced by Normal 
Extensions. We have to account for the case of productions which 
require several, rather than just one, strings in the antecedent. The 
proof of this, below, shows that this most general system can be reduced 
(in the form of a Canonical Extension) to the case of the single-ante¬ 
cedent production. Having shown this, the result of the present section 
completes the proof of the general theorem. We feel that the method of 
the following proof is instructive in itself, if only in showing how 
it may be useful to employ, for theoretical use only, a kind of reck¬ 
less extravagance. 

Reduction of multiple-antecedent productions to single-antecedent 
productions 

We consider now the most general form of Post's Canonical systems, 
in which a production may require more than one existing string in order 
to produce a new string. A most classical example of this occurs in 
elementary logic where from A and A B one produces B. In the most 



where the <r's and t's are given constant strings and each is one 


of the a 


Below we show how to make a canonical extension P* of P which has 


only single-antecedent productions. Let a,,...,a be the alphabet of 

P. Let be the strings which are the Axioms of P. Our new 

P 

system will have but one axiom: let 'A' be a new symbol. 

AA^AASgAA . . . AA$ p AA 

(There would be little point to having several axioms now, since with 
single-antecedent productions there could be no combining of their sep¬ 
arate consequences.) Corresponding to the old production tt we introduce 
one new symbol X and the following monstruous production tt*: 


9. 



This has the effect of analysing a string to see if it contains sub¬ 
strings which have the forms required of the antecedents. If it finds 
such strings, sandwiched between A's, and if they occur in the given 
order--an undesirable requirement to be lifted shortly--it adjoins the 
resulting consequent, and some other material. 

Why cannot we simply adjoin the consequent? The reason is that 
the analysis may be spurious t in that one of the j' s can contain too 
much—parts from more than one proper P-string. This will be the case 
if any a^. contains one or more 'A's in its interior. This is why our 
production w* finally appends all the discovered variable strings in 
the form 


Xa 


11 


* a Sn s X * 


Note that the form does not end now with an A, so that tr* can not apply. 
We now check to see if this appendage contains no A's: 

[ c^Xa .of 2 X -*■ a.,Xc* 0 X I (j = l,...,r) 


O' Xa 2 X 


O' XX 


0l AA 


Then the string will return to standard form only if there were no A's 


in any of the selected a^.'s. 

The result is that as new strings are produced, in accord with the 
productions of the original system P, they are appended (between 'AA's) 
to the axiom string. Any assertion of P is ultimately so generated, 
and appears preceded by its entire "production history". 

An extension system is required to yield the original strings with¬ 


out any auxiliary letters. We do this by a similar method: 
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Aa, Aa~Ao' A 
1*3 

a l Ya j a 2 Y ”* 


-Ya 2 Y 
a, a .Yor Y 

1 J 2 


&YY 


a 


(j - 


The result is that any P-string which occurs between A's in a 
standard form P*-string will be released without auxiliary letters. 

One loose end. The production ir* requires the antecedents of tt 
to occur in a given order. There is no meaning to such a restriction 
in P. To eliminate it we could introduce (rather than just ir*) a distinct 
production like ir* for each possible order of the antecedents--that is, 
we need S! forms of m* with the antecedent parts permuted. This would 
complete the construction. A more elegant solution: we adjoin the 
single production 


Aa^AAo'gAAa.-A 


A« 1 AAa 2 AAa AAttgAA. 


This makes it possible to take any interior P-strlng and append it to 
the end of the compound P*-string. Thus the required antecedents can 
be placed at the end in any order required by a ir* production. 
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Part II 

A "universal" canonical extension 


It Is very easy to construct an analogue of a Universal Turing 
machine in the form of a canonical system. Let P be an arbitrary Normal 
system with axioms $,,...,<1 J S --and productions a^or* ar 1 . Let a 1 ,...,a r 
be the alphabet of P, and let A, C, S, and T be new letters. Our 
system ^TpWill have the single axiom 

ACA(r 1 CT 1 Arr 2 Cr 2 . ..A^Cr^S^S^.. .S$ a STTT. 

Observe that this axiom is a complete description of P, since from it 
one can reconstruct all the axioms and productions of P. Our productions 

k f 

are:* 

tt 1 a 1 AaCTAa 2 SpaSo' TTT “► a-LAffCrAagSPaS^aTSTfraTTparTofr 



a l Ta j 0 ^ Ta i a 3 To! 4 


a,TTTa 
o', TTTa 



2 


or^To^To' To^ 


olTTT 


O', 


(j = 1 ,... ,r) 


We assert that this system is a canonical extension of PI The 
remarkable aspect of this is that the productions ofVp do not depend 
in any way on the structure of P (except for the restriction on the 
alphabet, which we will dispose of shortly). Thus* 7 !/ p is like a 
Universal Turing machine in that, given (as its axiom) a description 
of an arbitrary canonical system P, the productions of u will generate 
precisely the theorems of P. 

To see that *l/pis a canonical extension of P, let us examine the 
effect of tt^. ir^ "looks" for a substring AcrCrA and a substring S(3aS 
and "attempts" to adjoin the string or in accord with some production 
of P 


(T. a — (XT. . 
i i 

This operation is valid only if (l) 3 = o and (2) if the strings a, p, 
or, t do not contain upper-case letters. For if a string has the form 


* Here, all Greek letters represent variables. 
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ActCtA with <T and r having no upper-case letters, then rrand rmust be 
some cr. and r ± . The productions ir 2 check both conditions, for they 
remove, one at a time, letters from^aaT and par, but only if the first 
letter of each is at each step the same. Both strings can vanish only 
if (r = p. Then production ir? can apply, adjoining the produced string 
(XT.- as though it were a new axiom of P. Production w ^ detaches the 
new string from all upper-case letters, to yield a legitimate theorem of 
P. 

We put the Initial 'AC' in tt, so that the system would be sure to 
produce the axioms of P, in accord with the production 

a —► a 

which may not be included among the productions of P. 

Now for the alphabet problem. We simply encode P's alphabet a,,. 
..,a r into a two-letter alphabet a, b: simply let 'a.' be represented 
by a 'b' followed by . 'a's. Then everything works as before, and we 
need only two productions iTg and i for^V. So ends up with just 
five productions. It no longer is an extension of P, but it is (with 
encoded axiom of P) an extension of a trivial re-coding of P. 

Problem 1 . Three letters is minimal for a Universal Canonical Extension. 

(Why? ) But we can construct a version of *1/ with only four productions. 
Hint: A simple change in makes tt_ unnecessary. Solution on next 

page. 


I can't see how less than four productions could suffice, for we 
need one to "do the work", two to check the alphabet conditions, and one 
to "release" the pure string. Such reasoning, however, usually turns 
out to be unsound I 

Problem 2 . Construct a version of with only one extension letter, 
still using only four productions. (Easy) 

Problem 3 . Construct a version of V using only normal productions. 
(Laborious.) 

Problem 4 . Using the analogy with a Universal Turing machine, construct 
an unsolvable decision problem forV. Consider the prospect of develop¬ 
ing the theory of computability on this basis, rather than on the Turing 
basis. 












CS-TR Scanning Project 
Document Control Form 


Date : H / 30 ft 5 


Report # AJ H 1 / 

Each of the following should be identified by a checkmark: 

Originating Department: 

Artificial Intellegence Laboratory (Al) 

□ Laboratory for Computer Science (LCS) 

Document Type: 

□ Technical Report (TR) Technical Memo (TM) 

□ Other:__ 

Document Information Number of pages: 

Not to include DOD forms, printer intstructwns, etc... original pages only. 

Originals are: Intended to be printed as . 

^ Single-sided or ^2^ Single-sided or 

□ Double-sided ^ Double-sided 

Print type: 

□ Typewriter □ Offset Press □ Laser Print 

□ HMPM. □ *■■■«fy Q F /hi <*» <?«*** 

Check each if included with document: 

□ DOD Form □ Funding Agent Form □ Cover Page 

□ Spine □ Printers Notes □ Photo negatives 

□ Other:__ 

Page Data: 

Blank Pages/* page numbed. 

Photographs/Tonal Material (by page numbed. 


Other (note description/page numbed. 




Scanning Agent Signoff: 

Date Received: l( / 3v 1^5 Date Scanned: /X/ I) / 9S 



Date Returned: 


Scanning Agent Signature: 


Rev 9/04 DS/LCS Document Control Form cstfform.vad 



Scanning Agent Identification Target 


Scanning of this document was supported in part by 

the Corporation for National Research Initiatives, 
using funds from the Advanced Research Projects 
Agency of the United states Government under 
Grant: MDA972-92-J1029. 


The scanning agent for this project was the 
Document Services department of the M.I.T 
Libraries. Technical support for this project was 
also provided by the M.I.T. Laboratory for 
Computer Sciences. 


Scanned 

Date: 

M.I.T. Libraries 
Document Services 


darptrgt.wpw Rev. 9/94 


